how to handle duplicate data in pandas data frame